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PROTEOMIC INTERACTION ARRAYS 

RELATED APPLICATIONS 

This application is a continuation of International Application No.: 
PCT/USOO/02974, filed on February 4, 2000, which claims the benefit of U.S. 
5 Provisional Apphcation No. 60/1 18,901 , filed February 5, 1999, the teachings of which 
are incorporated herein by reference in their entirety. 

BACKGROUND OF THE INVENTION 

The identification of new pathways involving protein-protein interaction in 
disease states has a high commercial value because of their potential to identify 

1 0 therapeutic targets. A variety of procedures have been developed to identify interactions 
between proteins. Three common biochemical methods to screen for interacting 
proteins are, co-immunuprecipitation, affinity chromatography, and expression library 
screening. Coimmunprecipitation is one of the most common biochemical methods to 
search for interacting proteins. For example, the well-known interaction between 

1 5 retinoblastoma protein (p 1 1 0^) and adenoviral protein E 1 A was obtained using this 

approach (Whyte et al. Nature 334:124-129 (1988)). Affinity chromatography typically 
involves a bait protein linked to beads. Proteins that interact with a bait protein bind the 
bait and are eluted after washing the column. A typical application is to use glutathione- 
s-transferase protein (GST) fiised to a polypeptide of interest as the bait protein. 

20 Expression library screening involves screening the library using a labeled bait protein 



as a probe. Expression library screening has been successful in identifying genes 
encoding proteins that interact with calmodulin, jun, myc, the EGF receptor and the 
retinoblastoma protein. Although these methods have generated many significant 
results, they are all in vitro methods and are generally difficult to modify for high 
throughput. Furthermore, once the interacting proteins are identified, the cloning of its 
corresponding gene could be very challenging and not easily engineered for high 
throughput analysis on discovery. 

An in vivo genetic approach to detect protein-protein interactions is the yeast 
two-hybrid system. The two-hybrid system is typically applied to detect the interaction 
between two proteins or to isolate interacting proteins fi*om a library using a specific 
bait. 

The two-hybrid system permits an in vivo identification of the interacting 
proteins. Hence, the conformation of the target protein in yeast cells is closer to the 
native form than most of the in vitro conditions that are available, and reasonably it is 
therefore more likely to yield physiologically significant proteins. It is likely to be 
more sensitive for detection of protein-protein interaction than many other methods, 
such as probing an expression library with a labeled protein or co-immunoprecipitation, 
based on the parallel comparisons (Li et al, FASEB J. 7:957-963 (1993)). This 
sensitivity allows the isolation of weaker or transiently interacting proteins. Numerous 
protein interactions have been successfully detected by using the two-hybrid system, 
including cell cycle factors, signal transduction factors, proteins involved in apoptosis 
and DNA repair. 

The use of the two-hybrid approach to determine protein-protein interaction 
rapidly and on a large scale has certain obstacles. Modification for high throughput 
analysis of protein-protein interaction or high throughput identification of novel 
interacting proteins requires a tremendous amount of labor intensive subcloning of gene 
sequences of interest and sequencing of newly discovered candidate genes fi-om 
libraries, making high throughput two-hybrid approaches time consuming and 
expensive. Traditionally, two-hybrid screens are performed using a single bait protein, 
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screening nucleic acids that encode potentially interacting proteins, retrieving the clones 
encoding the interacting proteins, sequencing these clones, and storing the sequence 
information in silico. The sequencing cost is extremely high when a genome-wide two- 
hybrid screen performed. 
5 Furthermore, it has been estimated that the human genome encodes 100,000 

proteins with potentially 50 billion different protein-protein interactions. However, not 
all of these interactions are expected to be involved in pathways that are either involved 
in disease or development or are suitable drug targets. Thus, analysis of all two-hybrid 
interacting pairs involves considerable time, effort and expense for interactions that may 

1 0 have little or no commercial or research value. 

Another limitation of the traditional two-hybrid approach has been the inability 
to reconstitute interactions mediated by several components or interactions that are 
dependent on specific post-translational modifications. Several assays have been 
described to overcome this barrier, including co-expression of a protein tyrosine kinase 

15 as a modifying enzyme to assess the interactions between phosphoproteins. However, 
these studies typically focus on a single bait protein and the interactions in the presence 
of the third protein, either as a modifier or stabilizer. 

Further, limitations of the two-hybrid approach are the presence of false 
positives and false negatives. Although improvements have been implemented to 

20 reduce the number of false positives, the problem still exists. From the bacteriophage 
T7 protein linkage mapping project, it was found that the large majority of false 
positives appear to be due to transcriptional activation from the DNA binding domain 
clones in the absence of protein-protein interaction. That is to say, the DNA-binding 
domain hybrid (BD-X) activates the reporter gene by itself Li addition, false positives 

25 can result from the non-specific interaction via short stretches of residues. 

SUMMARY OF THE INVENTION 

The present invention is drawn to a method of selecting nucleic acids encoding 
polypeptides, wherein the polypeptides are capable of interacting, in vivo (e.g., in a 



living cell) with a polypeptide of interest. The method comprises providing or 
generating an array of plasmid partners and probing the array with nucleic acid encoding 
a polypeptide of interest. Polynucleotides encoding a polypeptide that interacts with 
the polypeptide of interest are identified by contacting the array with polynucleotide 
probe encoding all or a portion of said polypeptide of interest under conditions where 
the probe detectably hybridizes to complementary sequence, if present, within any of the 
plasmids of the array. Plasmid partner or partners of the hybridized plasmid are 
identified and optionally isolated, wherein said partner or partners encode a polypeptide 
capable of interacting, under physiological conditions, with the polypeptide of interest. 

The plasmid partners comprise two or more plasmids wherein each plasmid 
comprises a polynucleotide sequence encoding a polypeptide fused or linked to nucleic 
acid sequence encoding a DNA binding protein domain or a transcriptional activation 
domain. The plasmid partners are selected to be in the array by their ability to, in 
concert, produce a detectable biochemical readout, e.g., transcription of one or more 
marker genes in a host cell. 

The present invention is drawn to a method of isolating polynucleic acids 
encoding at least one polypeptide capable of interacting, in vivo with a polypeptide of 
interest comprising: 

a) contacting at least one array of plasmids with a probe, wherein: 

i) said probe encodes the polypeptide of interest or fragment 
thereof, wherein said probe hybridizes to complementary 
sequence, if present, within any of the plasmids, and wherein: 

ii) said array comprises two or more plasmid partners, wherein a 
first plasmid partner comprises a first library fiised to a first 
nucleic acid sequence encoding a first half of a selection pair and 
a second plasmid partner comprises the same or a second library 
fiised to a second nucleic acid sequence encoding a second half of 
a selection pair and wherein the plasmid partners are selected to 



be in the array by their ability to, in concert, activate the selection 

pair in a host cell, and 
b) identifying the partner or partners to said hybridized plasmid, 
wherein said partner or partners encode a polypeptide capable of interacting, in 
vivo with the polypeptide of interest. 

The present invention is drawn to a method of generating an array of plasmids 
comprising: 

a) conducting a two-hybrid screen, wherein bait plasmids comprise a cDNA 
library and a prey plasmids comprise the same or a second cDNA library; 

b) selecting positive two-hybrid clones and 

c) immobilizing the bait and prey plasmids from said positive clones on a 
soHd support at known locations, thereby generating an array of 
plasmids. 

Li another embodiment of the present invention, a method is provided for 
selecting polypeptides capable of interacting in vivo with a polypeptide of interest, 
wherein the interaction is dependent upon post-translational modification. The method 
comprises providing or generating arrayed sets of plasmids comprising three or more 
plasmid partners, wherein a first plasmid partner comprises a library of sequences 
encoding polypeptides fused or linked to nucleic acid sequence encoding a DNA 
binding domain and wherein a second plasmid partner comprising the same or a second 
library fused or Unked to nucleic acid encoding a transcriptional activation domain and 
wherein a third plasmid partner comprises at least one post-translational modifying 
enzyme. The expression of said enzyme is optionally under the control of an inducible 
transcription system. The first and second plasmid partners are selected by their ability, 
in concert, and in the presence of the expressed post-translational modifying enzyme, 
activate transcription of one or more marker genes in a host cell. Polypeptides that 
interact with the polypeptide of interest are selected by contacting nucleic acid 
encoding the polypeptide of interest to the array, under conditions where the nucleic 
acid detectably hybridizes to complementary sequence, if present, within any of 
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plasmids. Plasmid partner or partners of the hybridized plasmid are identified and 
optionally isolated, wherein said partner or partners encode a polypeptide capable of 
interacting, in vivo and in a post-translational modification dependent manner, with the 
polypeptide of interest. 
5 In another embodiment of the present invention, a method is provided for 

selecting polypeptides capable of interacting under physiological conditions, with a 
polypeptide of interest, wherein the interaction is inhibited by post-translational 
modification. The method comprises providing or generating arrayed sets of plasmids 
comprising three or more plasmid partners, wherein a first and second plasmid partner 

10 as described above and a third plasmid partner comprising at least one post-translational 
modifying enzyme. The expression of said enzyme is optionally under the control of an 
inducible transcription system. The first and second plasmid partners are selected by 
their ability to, in concert and in the absence of the post-translational modifying enzyme, 
activate transcription of one or more marker genes in a host cell. The array is probed 

15 with a polynucleotide encoding polypeptide of interest as described above. 

The present invention provides a powerful tool to generate complete linkage 
map of proteins encoded by the cDNA library used to create the plasmid partner array. 
This information can be stored in the form of a DNA chip, avoiding the high cost of 
sequencing all clones. The method of the present invention is particularly suitable for 

20 high throughput operation. The archive of arrays of the present invention allows 

identification and retrieval of the X, Y, or both sequences using standard hybridization 
techniques. A sequence of interest is used to probe the arrays of the Unkage map by 
hybridization. A positive signal reveals both the sequence homologous to the probe as 
well as the partner plasmid. By using the linkage map of the present invention, one can 

25 probe the map with a sequence encoding a protein of interest, which hybridizes 

selectively to complementary sequences, when present, in the X or Y inserts and find a 
corresponding plasmid that encodes a protein that interacts with the protein of interest. 
For example, if the sequence encoding the protein of interest hybridizes to X^, then the 
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location of Yi in the array will be provided by the linkage map. The or Y^, sequence 
provided by the map can be used to probe for other interacting proteins. 

In the method of the present invention, any pair of DNA-binding domains and 
activation domains can be used. Furthermore, any site-specific transcription factor that 
5 has separable DNA binding domain and activation domain can be used. Other methods 
to identify protein-protein interactions in the two hybrid-derived system include other 
reconstitutive methods whereby two domains that together produce a biochemical 
readout can be physically separated such that the reassociation of the separated domains 
via protein X and Y interaction reconstitutes the biochemical readout; for example, as 

1 0 reviewed by Mendelsohn and Brent (the teachings of which are incorporated by 
reference herein in their entirety). The membrane binding and catalytic domains of 
guanine exchange factor can be used (Mendelsohn and Brent, Science 25-^:1948-1950 
(1999)). These separated domains are referred to herein as a first and second half of a 
selection pair, respectively. 

15 The present invention is advantageous over other biochemical methods for a 

number of reasons. The present invention uses arrays to store all protein-protein 
interaction information identified using two-hybrid screening without performing a 
large amount of subcloning or sequencing. Instead, the positively interacting pairs are 
identified using probes selected by the user to hybridize the array and identify all 

20 locations of the array that contain DNA encoding the protein of interest and partner 
plasmids in the same or linked array that encode proteins that interact with said protein 
of interest. Therefore, the present invention significantly reduces the cost of mapping 
protein interactions on a cellular, tissue or genome wide scale. Moreover, the arrays of 
the present invention provide an archive in which a user can obtain the interaction 

25 partners rapidly by DNA hybridization; a much faster and simpler technique than yeast 
two-hybrid screen. A database of interacting partners can be produced by storing the 
identity of partner pairs identified by hybridization in computer table. The modification 
of using different selection markers on bait and prey vectors in the two-hybrid system 
simplifies the plasmid DNA recovery process and speeds up the entire two-hybrid 



screening procedure. In addition, the process of generating the array, probing the array 
and identifying plasmids encoding interacting pairs of proteins can be automated. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic diagram of the generation of Proteomic Interaction 

Arrays. 

Figure 2 is a diagram of two plasmids for use in the two-hybrid screen of the 
present invention. 

Figure 3 is a schematic diagram of the generation of Proteomic Interaction 
Arrays using three protein, two-hybrid (3PTH). 

Figure 4 shows the hybridization of vector sequences to the test array of 
Example 1. 

Figure 5 shows the hybridization of an Snk probe to the test array of Example 1. 
Figure 6 shows the hybridization of an B II probe to the test array of Example 1, 

DETAILED DESCRIPTION OF THE INVENTION 

The human genome sequencing project has had a revolutionary effect on 
biological research. The decoding of all genes brings the blueprint of hfe to scientists, 
while the function of genes still remains a mystery. Since virtually all cellular processes 
are controlled by proteins, including disease processes and developmental processes, 
knowledge of how proteins interact is necessary to understand these processes. 

The present invention relates to a novel high throughput approach to two-hybrid 
analysis. The yeast two-hybrid system is a genetic approach which allows one to detect 
protein-protein interaction in vivo through the reconstitution of the activity of a 
transcriptional activator, such as GAL4, in yeast Saccharomyces cerevisiae. The key of 
the two-hybrid system is the finding that site-specific transcription factors are often 
modular, comprised of separable DNA-binding domains (BDs) that bind to a specific 
promoter sequence, and activation domains (ADs) that direct the RNA polymerase II 
complex to transcribe the gene downstream of the DNA binding site. This phenomenon 



is exploited by fusing separate binding and activation domains to a pair of interacting 
proteins, X and Y, to create two hybrid proteins, BD-X and AD-Y. If the X and Y 
proteins interact, co-expression of two hybrids in a yeast cell leads to expression of a 
reporter gene containing the cognate BD-binding site. This approach can be also used 
to isolate cDNAs encoding partners for a protein of interest from an AD-Y library. 

The present invention is drawn to a genome-wide scale two-hybrid screen. The 
method of the present invention involves a rapid plasmid retrieval system to recover 
plasmids from two-hybrid positive cells, an array system to store the interaction 
information using suitable substrates such as filter membranes or microarrays (eg. on 
glass slides), and a hybridization method to identity nucleic acid encoding proteins that 
interact with a polypeptide of interest. The present invention also relates to cDNA 
libraries constructed in suitable two-hybrid plasmids, such as the two plasmids shown in 
Figure 2. 

The present invention is drawn to a method to quickly identify DNA encoding 
proteins which interact with a protein of interest, by probing the arrays of the present 
invention with nucleic acid encoding the protein of interest. The method involves 
construction of an array of protein-protein interactions represented by plasmid pairs, 
wherein the plasmid pairs have been selected from a genome-wide scale two-hybrid 
screen. The collection of plasmid pairs is referred to herein as an ''interaction library" 
or array, hi this embodiment, the first and second plasmid pairs are generated using the 
cDNA library from a cell hne or tissue of interest. In another embodiment, the array 
represents the entire complement of protein-protein interactions of an organism. The 
present invention can also be appHed to other screens, such as yeast three-hybrid, one- 
hybrid, and mammalian two-hybrid screen. 

hi the method of the present invention, positive yeast hybrids from the two- 
hybrid assay are selected and distributed in an array, such as a two dimensional array. 
The clones are also stored for fiiture access. Each yeast hybrid is a clone containing at 
least two plasmids. To identify nucleic acid encoding pol>peptides that interact with a 
polypeptide of interest, the plasmids from the yeast clone array are transferred to a solid 
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support for hybridization screening. The plasmid array is probed with a nucleic acid 
selected by a user. In one embodiment, the nucleic acid encodes a protein of interest. 
The array is contacted with the probe under suitable hybridization conditions. The wells 
or array locations containing nucleic acid homologous with specific probe are identified 
by hybridization with the probe. Vectors or plasmids encoding interacting partner(s) of 
the protein of interest are also identified. After identification of the vectors encoding 
interacting partners, the process can be repeated with the newly identified polynucleic 
acid molecules to reveal polynucleic acids encoding proteins that interact with the 
previously identified interacting partners. Thus, the method and arrays of the present 
invention provide networks of interaction pathways within a cell. 

The skilled artisan will recognize that factors commonly used to impose or 
control stringency of hybridization include formamide concentration (or other chemical 
denaturant reagent), salt concentration (i.e., ionic strength), hybridization temperature, 
detergent concentration, pH and the presence or absence of chaotropes. Optimal 
stringency for a probe/target combination is often found by the well known technique of 
fixing several of the aforementioned stringency factors and then determining the effect 
of varying a single stringency factor. Optimal stringency for hybridizing the user 
defined probe to the array may be experimentally determined by examining variations of 
each stringency factor until the desired degree of discrimination between specific and 
non-specific sequences has been achieved. The level of stringency will increase or 
decrease depending on whether the target and variable regions are complementary or 
substantially complementary. 

A general description of stringent hybridization conditions is provided in 
Ausubel, F.M., et aL, Current Protocols in Molecular Biology, Greene Publishing Assoc. 
and Wiley-Interscience 1989, the teachings of which are incorporated herein by 
reference. The influence of factors such as probe length, base composition, percent 
mismatch between the hybridizing sequences, temperature and ionic strength on the 
stability of nucleic acid hybrids is well known in the art. Thus, stringency conditions 
sufficient to allow the user defined probes to hybridize with specificity to a homologous 



nucleic acid sequence, if present, in the array can be determined empirically. The probe 
need not hybridize to the nucleic acid sequence of interest with exact complementarity, 
so long as the target nucleic acid sequence of interest is identical or nearly identical to the 
probe, e.g., a homolgue or variant. 

Conditions for stringency are also described in: Secreted Proteins and 
Polynucleotides Encoding Them, (Jacobs et al, WO 98/40404), the teachings of which 
are incorporated herein by reference. In particular, examples of highly stringent, 
stringent, reduced and least stringent conditions are provided in WO 98/40404 in the 
Table on page 36. In one embodiment of the present invention, highly stringent 
conditions are those that are at least as stringent as, for example, Ix SSC at 65"C, or Ix 
SSC and 50% formamide at 42°C. Moderate stringency conditions are those that are at 
least as stringent as 4x SSC at 65°C, or 4x SSC and 50% formamide at 42°C. Reduced 
stringency conditions are those that are at least as stringent as 4x SSC at 50°C, or 6x SSC 
and 50% formamide at 40°C. 

The present invention expedites the process of discovering novel interaction 
pathways while minimizing the need for subcloning or sequencing polynucleotides that 
do not encode proteins that interact with a polypeptide of interest. 

Also provided by the present invention are two-hybrid plasmids, one containing 
the DNA-binding domain (first plasmid) and the other containing the activation domain 
(second plasmid). These two plasmids use two different E, coli selection markers. 
Useful selection markers for the present invention include for example, genes encoding 
ampiciUin, kanamycin, chloramphenicol, tetracycline, Zeocine and trimethoprim 
resistance. In one embodiment, three selection markers can be used. For example, a 
unique selectable marker can be present on each of the plasmids, such as ampiciUin on 
plasmid one and kanamycin on plasmid two, and a common selectable marker present on 
both plasmids. In another embodiment, one marker can be used a common marker 
present on both plasmids. The markers allow the isolation of the plasmids using 
techniques well known in the art. For example, the plasmids isolated from the two 
hybrid clone can be transformed into E, coli which are grown in the presence of the 
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appropriate antibiotic. Plasmids are purified from the selected E. coli using standard 
techniques in the art. A common marker would allow the isolation of both plasmids in a 
single step, saving time and expense involved in separate isolations. 

In the method of the present invention, a protein-protein linkage map is generated 
5 and stored as an array, without the need for subcloning or sequencing the X or Y inserts. 
The present invention also provides a proteomic array of interactive protein pairs, and 
nucleic acid encoding the pairs and is also referred to herein as an archive. The 
interactive protein pair information can be stored as a database comprising at least one of 
said arrays, coupled with information describing the linkage between the members of 

10 said array. For example, each selected two-hybrid clone harbors a bait plasmid and a 
prey plasmid. These plasmids can be stored together in the same location of the two 
dimensional array, or bait plasmids from all two-hybrid clones can be stored in one array 
or set of arrays while the prey plasmids are stored in a separate array or set of arrays. 
The bait and prey arrays will be hnked by information such that identifying a given bait 

15 or prey plasmid reveals the location of the corresponding prey or bait plasmid, 
respectively. 

The arrays of the present invention can be of any suitable size on any suitable 
substrate, hi one embodiment of the present invention, the arrays are two dimensional 
In another embodiment, the substrate is a plastic tray or plate comprising wells, such as a 

20 96 well or 384 well plate. Said plates are well known in the art. In another embodiment, 
the array can be a series of spots on a substrate, such as nylon membrane, glass slide, or 
photolithographic biochip. The amount of DNA in a given spot can be as httle as 
nanogram quantities, however, less can be used depending on the sensitivity of the 
detection system. The number of spots in an array can be very large depending on the 

25 resolution of the spotting or printing apparatus used. 

The two-hybrid systems and proteomic arrays of the present invention can be 
combined a microarray system. As a result, differentially expressed genes can be 
identified in normal or disease states, or at particular stages of development without 
having to examine 50 billion potential interactions. Methods of making microarrays are 
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well known in the art and methods of identifying differentially expressed genes is 
described in U.S. Serial No. 09/350,609, the teachings of which are incorporated herein 
by reference in their entirety. 

hi another embodiment of the present invention, a method is provided for 
selecting polypeptides capable of interacting in vivo with a polypeptide of interest, 
wherein the interaction is dependent upon post-translational modification (3 protein, 
two-hybrid). The method comprises providing or generating arrays comprising a first 
plasmid partner comprising a library linked to nucleic acid sequence encoding a DNA 
binding domain and a second plasmid partner comprising the same or a second library 
Hnked to nucleic acid encoding a transcriptional activation domain, wherein the first and 
second plasmid partners encode proteins that interact in the presence but not the absence 
of the third plasmid partner comprising at least one post-translational modifying enzyme. 
The expression of said enzyme is optionally under the control of an inducible 
transcriptional system. The first and second plasmid partners are selected by their ability 
to, in concert and in the presence of the expressed post-translational modifying enzyme, 
activate transcription of one or more marker genes in a host cell. Li another 
embodiment, the first and second plasmid partners encode proteins that interact in the 
absence but not in the presence of the third plasmid partner. 

Plasmids or vectors that encode all or a portion of a polypeptide of interest and 
polypeptides that interact with the polypeptide of interest are selected by contacting the 
array under conditions where the probe detectably hybridizes to complementary 
sequence, if present, within any of plasmids of the array. Plasmid partner or partners of 
the hybridized plasmid are identified (e.g., locahzed on the array) and optionally 
isolated, wherein said partner or partners encode a polypeptide capable of interacting, 
under physiological conditions and in a post-translational modification dependent 
manner, with the polypeptide of interest. 

The post-translational modifiers selected for use in the third dimension of the 
3PTH system can be selected from known genes encoding desired members of the 
family. For example, genes encoding several protein kinases have been cloned, such as 
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PKA, S YK, p34cdc2, PKC and PIS kinase and can be readily used in the method of the 
present invention by one of ordinary skill in the art. Similarly, genes for phosphatases, 
glycosylation enzymes and endoproteases have been cloned and can be readily used in 
the method of the present invention. 

In another embodiment of the present invention, the modification enzyme 
encoded by the third plasmid is under the control of an inducible promoter such as 
MET25. Inducible promoters are well known in the art and can readily be incorporated 
into the method of the present invention. Other inducible systems include heat shock, 
GAL and tetracycline inducible promoter systems. 

In one embodiment of the present invention, one or more post-translational 
enzymes are used in the third dimension of the 3PTH system. In a yet another 
embodiment of the present invention, the genes encoding post-translational modifiers of 
the third dimension can be a library of genes encoding such proteins. A family of 
kinases, proteases, glycosylation enzymes or endoproteases representing all or a portion 
of the cellular complement of such proteins can be generated, for example using PGR. 
For example, primers for PGR can be used that recognize known motifs in the class of 
post-translational modifier to be used in order to amplify all or a portion of the sequences 
encoding said modifiers. In one embodiment of the 3PTH method of the present 
invention, the third plasmid partner is not included in the arrays produced. In one 
embodiment, the first and second plasmid partners are selected using selective markers 
that is not present on the third plasmid partner. 

In another embodiment, of the present invention, at least one plasmid partner is 
generated firom a normahzed library. Libraries can be normaUzed, for example, as 
described by Sive and St. John (Nucleic Acids Res. 16:10937 (1988)) and in U.S. Serial 
No.: 60/067,992 the teachings of which are both incorporated herein by reference in their 
entirety. 

The plasmid partners can be generated by fusing the library sequences to DNA 
encoding one half of the selection complex e.g., the DNA binding domain sequence or 
transcription activation sequence such that a fiision protein comprising both segments is 
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expressed. In one embodiment, the fusion is in frame between the coding sequence of 
both segments. However, libraries containing some out of frame fixsions can be used, in 
which case the library is generally made with more clones. In one embodiment of the 
present invention, the library of the first plasmid partner is fused at its 5' end to the 
5 sequence encoding a DNA binding protein. In another embodiment of the present 
invention, the library of the first plasmid partner is fused at its 3' end to the sequence 
encoding a DNA binding protein. Similarly, the Hbrary of the second plasmid partner 
can be fused at either its 5' or 3' end to the sequence encoding the other half of the 
selection complex; for example, if the first plasmid partner uses the DNA binding 
10 domain, then the second plasmid partner uses the DNA transcription domain to generate 
51 the second plasmid partner. 

31 In one embodiment of the present invention, the library comprises cDNA, full- 

length cDNA, genomic DNA or DNA encoding a peptide library. These libraries are 
readily available from commercial sources or can be synthesized using techniques well 

y; 15 known in the art. For example, full length libraries are produced using methods 

described in U.S. Serial No. 09/062,452 the teachings of which are incorporated herein 

I;:,! 

Ill by reference in their entirety. 

CI 

1 :| The DNA binding protein and transcriptional activators used to generate the 

selection complex can be from any known transcriptional activation protein that binds 
20 DNA wherein the DNA binding domain binds in a sequence specific manner. Useful 
transcriptional activators are well known in the art. Particularly useful are those proteins 
where in the DNA activation domain and the DNA binding domain are separable at the 
DNA sequence level. In a further embodiment of the present invention, the DNA 
binding protein domain is selected from the group consisting of GAL4, lexA, GCN4 and 
25 ADRl. In another embodiment of the present invention, the transcription activation 
domain is selected from the group consisting of GAL4, GCN4, ADRl and herpes 
simplex VP 16. 

The DNA binding site is typically placed upstream of a gene encoding a 
selectable marker for the host organism such that protein-protein interaction between the 
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polypeptides encoded by the first and second partner plasmid results in transcription of 
the gene encoding the selectable marker. Selectable markers are well known in the art 
and include, for example, genes that render the host prototrophic for a given nutrient and 
genes that encode enzymes that produce a color or fluorescent product when exposed to 
5 the appropriate substrate. 

In one embodiment, yeast clones harboring plasmid partners that encode 
interacting polypeptides are selected by growing the fiised two-hybrid host on medium 
lacking the nutrient required in the absence of transcription of the gene encoding the 
selectable marker. In another embodiment the fused two-hybrid hosts are grown on 
10 medium containing the appropriate colorimetric or fluorogenic substrate for the enzyme 
encoded by the selecatable marker gene. 

The two-hybrid hosts can be fused in batch. In one embodiment, the fused two- 
hybrid hosts are plated on the selective medium such that indvidual colonies are derived 
from individual fused hosts. Positive clones can be picked or isolated by hand or by 
1 5 robotic methods. Colony picking robots are well known in the art. In another 

embodiment, the fused two-hybrid hosts are contacted with the fluorgenic substrate and 
positive cells are selected using a fluorecence activated cell sorter. Methods of 
fluorescence activated cell sorting are well known in the art. 

Suitable hosts for the two-hybrid screen of the present invention include any 
20 transformable organism that can be grown as a single-celled organism. Such hosts 
include prokaryotes and eukaryotes. hi particular eukaryotic hosts can be yeast and 
mammalian or other cell culture. 

The arrays of the present invention can be generated using polynucleic acid 
libraries from any organism of interest, including prokaryotic, archebacterial and 
25 eukaryotic organisms. 

The present invention is drawn to a composition comprising an array of plasmids 
comprising two or more plasmid partners wherein a first plasmid partner comprises a 
first library fused to a nucleic acid encoding a DNA binding domain, a second plasmid 
partner comprises the first or a second library fused to a nucleic acid sequence encoding 
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a transcriptional activation domain, wherein the first and second plasmid partners are 
selected to be in the array by their abihty to, in concert and in the absence of expression 
of said post-translational modifying enzyme, activate transcription of one or more 
marker genes in a host cell, wherein the post-translational modifying enzyme is encoded 
by a third plasmid partner in the host cell. 

The present invention is drawn to a composition comprising an array of plasmids 
comprising two or more plasmid partners wherein a first plasmid partner comprises a 
first library fused to a nucleic acid encoding a DNA binding domain, a second plasmid 
partner comprises the first or a second library fused to a nucleic acid sequence encoding 
a transcriptional activation domain, wherein the first and second plasmid partners are 
selected to be in the array by their ability to, in concert and in the presence of expression 
of said post-translational modifying enzyme, activate transcription of one or more 
marker genes in a host cell, wherein the post-translational modifying enzyme is encoded 
by a third plasmid partner in the host cell. 

Methods for reducing the false negatives include making different libraries. For 
example, random primed cDNA two-hybrid libraries can be constructed to obtain small 
protein domains which may be buried in the intact proteins in a specific condition. 
Second, the protein fusion interface can be changed. Traditionally, the DNA binding 
domain and activation domain are located in the N-terminus of the fusion protein. New 
libraries can be constructed with the DNA binding domain and activation domain located 
at its C-terminus, so that the N-terminus of the bait protein can be fi-ee for its 
interactions. Third, the libraries for the first and second plasmid partner can be enriched 
for full-length genes. Furthermore, several "cytoplasm two-hybrid systems" have been 
developed. Cytoplasm two-hybrid systems can be integrated into the 3PTH system to 
cover those proteins which do not interact properly in the nucleus. 

The invention will be further illustrated by the following non-limiting example. 
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EXAMPLES 

Example 1 : Three-Protein Two Hybrid Screen 
Plasmid Vectors: 

Two vectors with three drug resistance genes are constructed. Each vector carries 
an unique E.coli selection marker such as Zeocin or DHFR (DHFR represents 
DiHydroFolate Reductase and confers resistant to Trimethoprin). The vectors also carry 
an additional common selection marker, P-lactamase, The drug resistant gene specific to 
each vector increases the efficiency of recovering the plasmids that are positive. The 
cycloheximide counterselection system (Harper et al. Cell 75:805-816 (1993)) can be 
used to optimize selection. 

The DNA-binding domain (DB) vector (or first plasmid partner) is constructed 
by inserting the Zeocin resistant gene into pGBT9 (Bartel et al. Methods Enzymol, 
254:241-263 (1995)) or pDBTrp (Vidal, Bartel and Fields, Eds. Oxford Univ. Press 109, 
(1997)). pACT2 constitutes the basis of the activation domain (AD) vector (second 
plasmid partner) with the addition of the DHFR gene. pGBT9 and pDBTrp are selected 
because they yield low levels of false positives. While not wishing to be bound by 
theory, this may be due to their low level of gene expression. The main source of false 
positives usually originates from the activation of the DB-vector reporter gene by itself 
pDBTrp is a centromere-based (low copy number) expression plasmid with the full 
length ADHl promoter. pGBT9 is a two micron-based (high copy number) expression 
vector with a truncation to give a minimal activity ADHl promoter. 

Yeast Strains: 

The promoter strength of the reporter gene and the expression level of the two 
hybrid proteins determine the sensitivity of two-hybrid system. Thus, the selection of the 
yeast host strain is critical for success. In general, the upstream activating sequence 
(UAS) of GALl is stronger than GAL2 UAS , and GAL2 UAS is stronger than the 
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synthetic GAL4 binding site consensus sequence (UAS G17-mer). The available host 
strains shown below are compared and the optimal pair is selected. 

MATa, tipl-901, leu2-3,112, ura3-52, his3-200, gal4, gal80, 
LYS2::GAL1uas-GAL1tata-HIS3,GAL2uas-GAL2tata-ADE2 
(James etal. Genetics 144:1425-1436 (1996)) 

MATa, ura3-52, his3-200,ade2-101, trpl-901, leu2-3,112, met-, gal4, 
galSO, URA3:: GALluAs-GALlTAiA-lacZ {Harper et al.. Cell, 75:805-816 
(1993)). 

MATa, leu2-3,112, trpl-901, his3_200, ade2-101, gal4, gal80, 
SPAL10::URA3, GALl(GALluAs)::lacZ, HIS3(GAL1uas)::HIS3 @LYS2 
(Vidal etal., Proc. Natl. Acad. Sci. USA P5:10315-10328 (1996)). 

MATa, leu2-3,112, trpl-901, his3_200, ade2-101, gal4, gal80, 
SPAL10::URA3, GALl(GALluAs)::lacZ, HIS3(GAL1uas)::HIS3 
@LYS2 (Vidal, 1997 ) 

1 5 The host strain pair P J69-2A A^l 87 uses two different promoters (GALl and GAL2) on 
three different reporters (HIS3, ADE2, lacZ). The yeast pair MaV103 and MaV203 has 
SPAL10(UAS G17-mer) and GALl as the promoters in front of reporters URA3, HIS3, 
and lacZ. The sensitivity of SPALIO (UAS G17-mer) promoter to GALl and GAL2 is 
compared by using a group of known interacting proteins with different affinities (Table 
20 I), then selecting the strain with stronger promoters. 



mi 



PJ69-2A: 



Y187: 



MaV103: 

10 



MaV203: 
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TABLE I : Examples of Known Interacting Pairs 





Hybrid #1 


Hybrid #2 


Interaction Strength 


Pair 1 


human RB 
(aa302-928) 


human E2F 
(aa342-437) 


weak 

(Vidal et al., Proc. Natl. Acad Sci 
USA. P5(7P;:10315-20 (1996) 


Pair 2 


Drosophila DP 
(aal-377) 


Drosophila 
(aa225-433) 


moderate 
(Du et al., Genes Dev. 
I0(10):1206-1S (1996) 


Pair 3 


cFos (aal32-211) 


cJun (aa250-325) 


strong 

(Chevray & Nathans, Proc. Natl. 
Acad. Sci., USA., 89 (1 3):5789-93 
(1992)) 


Pair 4 


murine p53 
(aa72-390) 


SV40 T antigen 
(aa87-708) 


moderate 
(Iwabuchi et al.. Oncogene, 
8(6):l693-6 (1993)) 


Pairs 


yeast SNFl 


yeast SNF4 


weak 

(Fields & Song, Nature 340 : 245-6 
(1989)) 


Pair 6 


murine SNK 


himian CEB 


moderate 
(Yuan & Erikson, impublished) 



Library Construction: 

A series of brain cDNA libraries is constructed using AlphaGene's normalization 
10 and FLEX™ (Full-Length Expressed gene) cDNA library construction technologies U.S. 
Serial No.s: 09/062,452, the teachings of which are incorporated herein in their entirety. 
Plasmid pGBT9-Zeo has been constructed and tested by constructing a human fetal 
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brain library with a titer of 1 .8 x 1 0^ primary clones. The libraries were normalized. 
Table n summarizes the results of a-tubulin and P-actin abundance comparisons in two 
cDNA libraries, constructed from the same fetal brain mRNA. The protocol for library 
normalization is an improvement of the Sive and St. John protocol (Sive & St. john, 
5 1988;). A short hybridization, corresponding to an estimated Cot of four, was carried out 
before addition of streptavidin to remove double-stranded DNA. Hybrid selection with 
alpha tubulin and beta actin was performed, followed standard procedures. Li each 
hybrid selection an average of two thousand colonies were examined. 



TABLE U : Library Normalization Data 





Normalized Library 


Control Library 


Improvement 


a-tubulin (%) 


0.1 


0.9 


9X 


p-actin (%) 


0.1 


0.7 


7X 



Pre-screening: 

The number of false positive clones is reduced by performing a prescreen. URA 
counterselection is performed to remove the false positive signal from the DNA-binding 
15 domain alone. The DNA-binding domain library are transformed separately into strain 
MaV203 and ura" colonies are selected on 5-FOA plates. All surviving colonies are used 
for the two-hybrid interaction screening procedure. 

Literaction Screen: 

Yeast mating (Bendixen et al. Nucleic Acids Res. 22(9):inS'9. 1994) and 
20 plasmid transformation followed by nutritional selection are used in the two-hybrid 
interaction screen. The DNA-Binding domain (DB) library is transformed into strain 
PJ69-2 (a mating type a strain) and the Activation Domain (AD) library transformed into 
strain Yl 87 (a mating type a axpaxv). The a and a transfomiants are mated with 
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sub sequent nutritional selection. Plasmids that grow after nutritional selection are 
isolated. Optimization of the mating/transformation step is critical because yeast cells, 
unUke E, coli, can acquire multiple plasmids following transformation. The amount of 
DNA used in transformation is varied to alter the number of plasmids transformed into 
cells. 

For an initial test, pilot scale experiments are performed using two different 
approaches. Tests with 10 known interacting pairs with various affinities are conducted. 
The interactions among these 10 pairs are studied by 1) A matrix mating — the 
interaction of every possible pair (100 pairs in combination is examined); 2) A "library 
vs. library" or batch screen — the 10 clone pairs are mixed as two "mini-libraries" (one 
DB library and one AD library) followed by an interaction screen. The percentage of 
false positives and false negatives is determined for each selection marker. 

Following the initial tests, a small scale genome wide interaction screen is 
performed with a pair of two-hybrid FLEX™ cDNA libraries. The ADE selection 
marker was chosen. To tighten the screen an additional marker, the coli lacZ gene can 
be used. There are two possible alternatives to select for the clones, hi one scenario, the 
clones surviving nutritional selection are robotically picked to 96 well plates, followed 
by a liquid p-galactosidase assay with a chemiliiminescent substrate (Campbell et al, 
1995). Li the second scenario, a P-galactosidase filter assay is performed and the blue 
colonies are robotically distributed in 96 well plates. 

Plasmid Retrieval 

Isolation of plasmids fi-om yeast is not trivial The problem is particularly difficult 
when working with large plasmids (>6 kb). Low yields and genomic contamination are 
common. To rapidly isolate the plasmid, 1 .5 ml of saturated yeast cells were spun down 
and lysed in 10 |il of Lyticase for 60 min at 3T C, then 10 jil of 20% SDS was added 
with vigorous vortexing to help the cell lysis. The cells were put through one 
freeze/thaw cycle to ensure complete lysis. The whole cell lysate was passed through a 
spin column. The colunm beads of the high throughput spin column were purchased 
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from Phannacia Biotech (Sephacryl S-1000), The eluate from the spin column 
containing the purified DNA was collected for the transformation into E. coli for 
amplification. After the plasmid is isolated from yeast, the DNA is transformed back 
into E.coli for amplification. A multi-head electroporator from BTX, Genetronics, is 
used to increase the throughput. The two vector/three selectable markers system 
provides for efficient plasmid recovery. 

Confirmation 

Multiple E. coli transformants are picked, plasmid DNAs are isolated, and 
transformed back into yeast strains to confirm the interactions. Confirmation is 
necessary since yeast can carry multiple plasmids. It is worth noting that optimization of 
the plasmid transformation procedure may significantly lower the hkehhood that a yeast 
cell carries more than one type of plasmid. 

Construction of Arrays and Identification of Interacting Clones by Hybridization 
Amplification of the DNAs 

A library versus library two-hybrid screen was performed as described above. The 
DNA-binding domain library was prescreened to remove clones that can activate the 
reporters in the absence of protein-protein interaction. Clones that passed the prescreen 
were mated with clones from an activation domain library. A portion of the mated cells 
were selected for those carrying protein-protein interactions. Plasmids from a portion of 
the selected colonies were retrieved from yeast cells and amplified in E. coli then 
extracted using standard molecular biology protocol. 

One pair of interaction plasmids plus 10 known plasmids were spotted onto 
microarray slides in a duplicate fashion. Two independent clones from the 10 known 
genes were labeled with fluorescent CY3 or Cy5 for use as probes to determine whether 
the spotted plasmids can be correctly identified on microarray slides. Approximately 1 
|j,g of each plasmid DNA was resuspended in 5X SSC buffer for printing (spotting) onto 
the slides. The printing procedure was followed according to the manufacturer's 
instructions. Approximately 6ng of plasmid DNA was printed onto a single spot in 
duplicate. 
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Table VI 



Locations 
(duplicate) 


Plasmid Clone 
Name 


Insert Description 


Comment 


A1,B1 


YY367-1 


Bll inpGBT9 
vector 


Bll is a novel gene 
previously identified 
as interacting with 

bnk (at A4 and B4) 


A2, B2 


F2 


F2inpGBT9 


Interacts with F14 at 
A5 and B5 


A3, B3 


Alpha 4 


Alpha4inpGBT9 


Interacts with PP6 at 
Ao and Bo 


A4, B4 


YY89-1 


Snk in an activation 
domain vector 
(pGAD424) 


Snk, a protein kinase, 
interacts with Bl 1 at 
Al andBl 


A5, B5 


F14 


F14 in pGAD 

vector 


Interacts with F2 at 
A2 and B2 


A6, B6 


PP6 


PP6inpGAD 
vector 


Interacts with Alpha 1 
At A3 and B3 


A7, B7 


YY313-9 


Bll inpGAD 
vector 


Bll is a novel gene 
previously identified 
as interacting with 
Snk (at A4 and B4) 


A8, B8 


TD-1 


SV40 large T- 
antigen in pACT2 




A9, B9 


B75 


B75 in pGAD 
vector 




AID, BIO 


A18 


A18 inpGAD 
vector 




C1,C3 


2-hybrid clone 1 


An unknown clone 
in DAN-binding 
domain vector 


interacts with clone2 
at C2 and C4 


C2, C4 


2-hybrid clone2 


An unknown clone 
in activation 
domain vector 


interacts with clone 1 
at CI and C3 



-25- 



Fluorescent Probe Synthesis 

Genes Bl 1 and Snk were excised from clones YY313-9 and YY8-1 1 respectively 
and the inserts were isolated from agarose gel by centrifugation. Approximate 25ng of 
denatured DNAs were labeled by Cy3 and Cy5 dCTP (purchased from Amersham 
Biotech) in a reaction containing random primer mixture and reaction buffer (both 
provided by High Prime DNA Labeling Kit from Boehiinger Mannheim), 25|aM 2 - 
deoxyadenosine-5' - triphosphate, 25|j,M 2'thymidine-5 -triphosphate, 25iiM 2 - 
deoxyguanosine-5'-triphosphate, 5\iM 2 -deoxycytidine-5'-triphosphate and 20|a,M Cy3 or 
Cy5 labeled 2'-deoxycytidine-5 -triphosphate and 4 U of Klenow polymerase. The 
reaction was incubated at 37° C for 45 min and stopped by incubating at 65° C for 10 
min. The probes were purified by standard ethanol precipitation and resuspended in 10 
|il of hybridization buffer (6 X SSC, 5X Denhart's solution, 2% SDS, 0.1 [ig/fil of yeast 
tRNA). 

Hybridization 

2 |al each of Cy3 probe and Cy5 probe were combined for hybridizing the DNA on 
each of glass slide. The slides were placed in sUde chambers with a towel wet with 2 X 
SSC. They were brought up to 80 ° C for 1 0 min and then immediately put on ice to 
denature the DNA, hybridized at 62° C for 6 hours, then washed to remove the 
unhybridized probes by 2X SSC, dried, and scanned by GenePix 4000 Microarray 
Scanner from Axon Instruments. 
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Results 

1 . To evaluate the spotting procedure, a Cy5 or Cy3 labeled probe containing 

common vector sequences was hybridized to the DNA on glass slide. Figure 4 
showed that all DNAs were attached onto slide. 

5 2, To localize the clone Snk 

A Snk probe either labeled by Cy3 or Cy5 was hybridized to the slide. Figure 5 
shows Snk probe hybridizes both A4 and B4 DNAs which are the Snk clones. 

3 , To locaUze the clone B 1 1 : 

A Bl 1 probe either labeled by Cy3 or Cy5 was hybridized to the slide. Figure 6 
10 shows the B 11 probe hybridizes locations Al, Bl, A7, B7 (location 1 represents 

Bl 1 in pGBT9 and location 7 represents Bl 1 in the pGAD vector). 

Thus, a specijSc DNA probe can correctly identify homologous DNA on an array with 
little or no background. For example, if Snk is the polypeptide of interest, a labeled Snk 
polynucleotide is used to probe the interaction array. The hybridization identifies A4 
1 5 and B4. From the hnkage information shown in Table VI, the clone at Al and B 1 (B 1 1) 
is identified as a Snk interacting clone. The hnkage information is provided by the 
arrays of the present invention. This method can be used repetitively with newly 
identified interacting clones as the probes, to draw a protein interaction map and 
establish biological interaction pathways. 

20 
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Example 2: Construction of Two Hybrid Arrays 
Plasmid vectors: 

A new vector is constructed to be used as the third plasmid partner. This vector 
carries URA3 marker, a yeast 2\x origin of repHcation, a yeast nuclear localization signal, 
5 multiple cloning sites, ampicillin section marker and Col El origin of replication. A 
yeast inducible promoter (Met25), A constitutive promoter (pADH) can also be used. 

Yeast Strains: 

A new oc mating type strain is constructed from Y187 the URA3 phenotype is 
reverted to ura3 by 5-Fluoroorotic Acid (5-FOA) counterselection. The desired geontype 
10 is (Mat<^, ura3, his3, ade2, trpl, leu2, met, gal4A, galSOA, ura3::GALl^^-GALlTATA- 
lacZ). PJ69-2A (a mating type A strain) will be used in the mating assay as the second 
plasmid partner. 

Library construction: 

A series of brain cDNA libraries is constructed using technology described U.S. 

15 Patents 5,162,290, 5,643,766 and Serial No.: 09/062,452, the teachings of which are 
incorporated herein in their entirety . The bait library is cloned into the pGBT9 derived 
vector and the prey library is cloned into the PACT2-derived vector. Kinases are chosen 
for the third plasmid partner in 3PTH. Kinases are chosen to obtain sufficient activity. 
Kinases are selected based on the following criteria, either separately applied or applied 

20 in combination: (1) kinases that have been overexpressed in the host cell in the past; (2) 
kinases whose constitutive and/or inactive forms are available; (3) kinases which have 
homologous pathways in yeast (allowing activation in yeast by an endogenous activator 
if necessary); (4) kinases which are expressed in a tissue of interest (for example, brain 
tissue). The nucleic acid encoding the kinase or kinases are expressed under the control 

25 of an inducible promoter such as the MET25 inducible promoter. 
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The presence or absence of kinase activity under specified conditions is optimized 
using positive control pairs of proteins as shown in Table m. The first column 
represents the first hybrid protein, the second column represents the second hybrid 
protein, the third column represents the third plasmid partner (kinase) and the fourth 
5 column represents the affinity of interaction after the hybrid protein is phosphorylated by 
the kinase listed in column 3. 



Table IE 



Hybrid 1 


Hybrid 2 


Kinase 


Interaction 
Lever 


CREB 


CBP 


PKA 


increase 


IgE receptor 


SH2-B 


Syk or Lyn 


increase 


RGSZl 


Gzalpha 


PKC 


increase 


HsEgS 


dynactin (pi 50) 


P34cdc2 


increase 


NMDA receptor 


calmodulin 


PKC 


decrease 


Mu2 


CTLA-4 


PI3 


decrease 



15 1. of phosphorylated form 



3PTH Library Screening: 

Both bait and prey fusion protein containing plasmids are transformed into one 
haploid yeast strain (MATa, for example) and the kinase containing plasimid(s) are 
transformed into the other haploid strain (MATa). Figure 3 is a flow chart of the 3PTH 
20 system. 



Screen for Literactions that Occur Only in the Unphosphorylated Form: 

Both the pGBT9 based library and the PACT2 based library DNA are transformed 
into ura3 MATa cells having the ade* genotype. White (ADE"^) colonies are picked and 
arrayed onto 96 well plates by a robot. A control panel, including positive and negative 
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controls are also placed onto each plate. The arrays of cells are grown and replica plated 
onto the desired number of plates (e.g. the number of kinases to be screened plus one 
plate for a negative control of empty kinase vector). The kinase vectors (carrying the 
URA3 marker) are transformed into MATa cells. 

The MATa cells and MATa cells, transformed as described above, are mated in 
batch fashion. White colonies are selected and the GBT9 and pGAD424 fusion plasmids 
are recovered by Zeocin and Ampicillin selection, respectively, using standard plasmid 
isolation techniques. The desired clones have the following pheno types: 

Table IV 







GBT9 alone 


red 


GAD424 alone 


red 


GBT9 + GAD424 


white 


GBT9 + GAD424 + kinase 


red 



DNA encoding proteins that interact with a polypeptide of interest only in the absence of 
phosphorylation are selected and isolated by probing the array of plasmids recovered 
above with DNA encoding the polypetide of interest. 

Screen for Interactions that Occur Only in the Presence of Phosphorylation 

The initial screen for interacting proteins is performed as describe above except 
red transformed MATa colonies are picked by the robot. The kinase transformed MATa 
cells are mated to the selected, transformed MATa cells in batch fashion. The HIS+ and 
ADE+ colonies are arrayed onto 96 well plates. The arrays are replicated onto two 
plates. On one plate, kinase expression is turned off by adding methanol to tum of the 
MET2 promoter, or by adding 5-FOA to counter select the URA plasmid. The cells on 
the other plate are allowed to express the kinase. Colonies that are white on the kinase"^ 
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plate and red on the kinase' plate are selected. Plasmids are recovered as described 
above. The desired clones have the following phenotypes: 

Table V 



Plasmid 


Color in 3PTH Assav 


GBT9 alone 


red 


GAD424 alone 


red 


GBT9 + GAD424 


red 


Kinase + GBT9 


red 


Kinase + GAD424 


red 


GET + GAD424 + kinase 


white 



DNA encoding proteins that interact with a polypeptide of interest only in the presence 
of phosphorylation are selected and isolated by probing the array of plasmids selected 
above with DNA encoding the polypetide of interest. 

EQUIVALENTS 

15 Those skilled in the art will recognize, or be able to ascertain using no more than 

routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 



